Exploring MWEs for Knowledge Acquisition from Corporate Technical Documents
نویسندگان
چکیده
High frequency can convert a word sequence into a multiword expression (MWE), i.e., a collocation. In this paper, we use collocations as well as syntactically-flexible, lexicalized phrases to analyze ‘job specification documents’ (a kind of corporate technical document) for subsequent acquisition of automated knowledge elicitation. We propose the definition of structural and functional patterns of specific corporate documents by analyzing the contexts and sections in which the expression occurs. Such patterns and its automated processing are the basis for identifying organizational domain knowledge and business information which is used later for the first instances of requirement elicitation processes in software engineering.
منابع مشابه
Corporate Memory: A framework for supporting tools for acquisition, organization and maintenance of information and knowledge
In this paper we describe corporate memory which can support multiple knowledge acquisition, organization and maintenance tools. Memory holds and manages documents and related information and knowledge processed and created by such tools. Tools can work with several types of data such as documents, data in relational database and semantic data. Such diversity of information is needed due to dif...
متن کاملFrom Natural Language Documents to Sharable Product Knowledge
A great part of a company's product knowledge is often only available as natural language documents. The disadvantages of this source of information are its informal structure, the lack of actuality and general interdepartmental accessibility. We propose a way to integrate knowledge about a speciic product contained in technical documentation into corporate memory. By analysing e.g. maintenance...
متن کاملA Method for Semi-Automatic Ontology Acquisition from a Corporate Intranet
The focused access to knowledge resources like intranet documents plays a vital role in knowledge management and supports in general the shifting towards a Semantic Web. Ontologies act as a conceptual backbone for semantic document access by providing a common understanding and conceptualization of a domain. Building domain-specific ontologies is a time-consuming and expensive manual constructi...
متن کاملAutomated Acquisition of Multiword Expressions for Robust Deep Parsing
In this presentation, I mainly deal with automated acquisition of Multiword Expressions as a means of enhancing robustness of lexicalised grammars used in robust deep parsing for real-life applications. Specifically, I begin by taking a closer look at the linguistic properties of MWEs, in particular, their lexical, syntactic, as well as semantic characteristics. The term Multiword Expressions h...
متن کاملIndexing Corporate Memories through Ontologies
In the context of Knowledge Management, we carry out a Corporate Memories (CM) project for the Company CIRTIL. Our purpose is to focus on the modelling of the application domain. It is built as a domain ontology with a structure supporting a semantic model based on ontological relationships. In this paper we, present our S model which permits to model knowledge and to index documents. We also s...
متن کامل